Consistent Classification Algorithms for Multi-class Non-Decomposable Performance Metrics
نویسندگان
چکیده
We study consistency of learning algorithms for a multi-class performance metric that is anon-decomposable function of the confusion matrix of a classifier and cannot be expressed asa sum of losses on individual data points; examples of such performance metrics include themicro and macro F-measure used widely in information retrieval and the multi-class G-meanmetric popular in class-imbalanced problems. While there has been much work in recent years inunderstanding the consistency properties of learning algorithms for ‘binary’ non-decomposablemetrics [1, 2, 3, 4, 5], little is known either about the form of the optimal classifier for a generalmulti-class non-decomposable metric, or about how these learning algorithms generalize to themulti-class case. In this paper, we provide a unified framework for analyzing a multi-class non-decomposable performance metric, where the problem of finding the optimal classifier for theperformance metric is viewed as an optimization problem over the space of all confusion matricesachievable under the given distribution. Using this framework, we show that (under a continu-ous distribution) the optimal classifier for a multi-class performance metric can be obtained asthe solution of a cost-sensitive classification problem, thus generalizing several previous resultson specific binary non-decomposable metrics. We then design a consistent learning algorithmfor concave multi-class performance metrics that proceeds via a sequence of cost-sensitive clas-sification problems, and can be seen as applying the conditional gradient (CG) optimizationmethod over the space of feasible confusion matrices. To our knowledge, this is the first efficientlearning algorithm (whose running time is polynomial in the number of classes) that is provablyconsistent for a large family of multi-class non-decomposable metrics. Our consistency resultmakes use of a novel proof technique based on the convergence analysis of the CG method.
منابع مشابه
Optimal Decision-Theoretic Classification Using Non-Decomposable Performance Metrics
We provide a general theoretical analysis of expected out-of-sample utility, also referred to as decisiontheoretic classification, for non-decomposable binary classification metrics such as F-measure and Jaccard coefficient. Our key result is that the expected out-of-sample utility for many performance metrics is provably optimized by a classifier which is equivalent to a signed thresholding of...
متن کاملMulti-dimensional classification with Bayesian networks
Multi-dimensional classification aims at finding a function that assigns a vector of class values to a given vector of features. In this paper, this problem is tackled by a general family of models, called multi-dimensional Bayesian network classifiers (MBCs). This probabilistic graphical model organizes class and feature variables as three different subgraphs: class subgraph, feature subgraph,...
متن کاملDesigning a New Multi-objective Model for a Forward/Reverse Logistic Network Considering Customer Responsiveness and Quality Level
In today’s competitive world, the need to supply chain management (SCM) is more than ever. Since the purpose of logistic problems is minimizing the costs of organization to create favorable time and place for the products, SCM seek to create competitive advantage for their organizations and increase their productivity. This paper proposes a new multi-objective model for integrated forward / rev...
متن کاملNegative Selection Based Data Classification with Flexible Boundaries
One of the most important artificial immune algorithms is negative selection algorithm, which is an anomaly detection and pattern recognition technique; however, recent research has shown the successful application of this algorithm in data classification. Most of the negative selection methods consider deterministic boundaries to distinguish between self and non-self-spaces. In this paper, two...
متن کاملRegret Bounds for Non-decomposable Metrics with Missing Labels
We consider the problem of recommending relevant labels (items) for a given data point (user). In particular, we are interested in the practically important setting where the evaluation is with respect to non-decomposable (over labels) performance metrics like the F1 measure, and the training data has missing labels. To this end, we propose a generic framework that given a performance metric Ψ,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1501.00287 شماره
صفحات -
تاریخ انتشار 2015